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Abstract. Heavy flavour jet tagging is widely used in the determination of cross 
sections including the production of heavy flavoured quarks. This requires the 
knowledge of heavy and light flavour jet tagging efficiencies and their uncertainties. 
A system of eight non-linear equations can be used to determine these quantities by 
means of two different tagging algorithms and two data sets which differ in heavy 
flavour jet content. The analytical solution of this system of equations, derived by 
means of resultants is described in detail, including a discussion about its singularities 
which provide important insights, such as prescriptions to prevent badly chosen sample 
flavour compositions and working points of the used tagging algorithms. The analytical 
solution also provides an efficient and transparent way to determine the uncertainties 
on the solved quantities, taking correlations into account. 
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PACS numbers: 14.65.Fy, 14.65.Bt, 21.10.Tg 

1. Introduction 

Heavy flavour jet tagging is widely used in the determination of cross sections including 
the production of heavy flavoured quarks. This encompasses interesting physics 
processes, such as top anti-top quark pair production, Z — > bb production, Higgs boson 
production (H — > bb), etc. The b quarks fragment into long-lived hadrons, whose decay 
products manifest typically in the tracking system of a detector as displaced charged 
particle tracks. The energy deposits of particles are measured in the calorimeter of the 
detector. They are used to reconstruct jets, to which charged tracks can be associated. 

In simulation one can easily identify the quark flavour of a jet by checking if a 
heavy flavour quark or hadron is geometrically located inside the jet. In data one has 
to rely on reconstructed objects and quantities such as associated track displacements 
to determine the flavour content of a jet. Dedicated algorithms - referred to as tagging 
algorithms - have been developed to determine the flavour of jets based on their lifetime 
information. This procedure is not perfect and implies certain probabilities for the 
correct identification of jet flavours. The uncertainties on these probabilities enter 
among others into the systematics of cross section measurements of the physics processes 
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mentioned above. Therefore it is indispensable to determine the heavy and light flavour 
jet tagging efficiencies and their uncertainties as accurate as possible. 

System8 pQ provides an Ansatz which allows one to determine these quantities. It 
consists of a system of eight algebraic equations 



n 


= n b + n q , 


ut 


= tTn b + f T n qi 


n s 


= e s n b + fgn q , 


n TS 


= e T e s n b + f T fsn q , 


V 


= Pb+Pq, 


Pt 


= CTPb + fTPq, 


Ps 


= tSPb + fsPq, 


Pts 


= e T e s Pb + fxfsPq, 



where n is a data sample of a given number of jets. It consists of two subsamples, one 
containing the number of heavy flavour jets n b and another one containing the number of 
light flavour jets n q . The contribution of charm flavoured jets is predominantly absorbed 
into the light flavour subsample. Similarly, another data sample p can be subdivided 
in its heavy and light flavour contributions p b and p q . It is important that the two 
data samples n and p differ in their flavour composition; otherwise the system of eight 
equations degenerates to a system of four linearly independent equations which cannot 
be solved for its eight unknowns anymore. The two data samples can share common 
jets and are therefore in general not statistically independent. The efficiencies to tag 
heavy flavour jets are designated as e and the efficiencies to tag light flavour jets - also 
referred to as fake rates - are designated with /. To determine these efficiencies and 
fake rates, two different tagging algorithms T and S, which have to be de-correlated [2J, 
are needed. 

The de-correlation requirement can be achieved in choosing two tagging algorithms 
T and S, which determine the flavour based on the information of different jet or 
associated track quantities which are de-correlated. In practice one tagging algorithm 
can be chosen, based on the lifetime information of a jet through the impact parameter 
of associated tracks, e.g. by means of a secondary vertex [3] |1] tagger. Since muons in 
jets are a signature of heavy flavour decay an appropriate choice for the second tagging 
algorithm is a muon tagger, which looks for muons in jets with a certain transverse 
momentum with respect to the jet axis. In the following, it is assumed that the taggers 
are fully de-correlated, in which case the unknown quantities can be obtained relying on 
data only. Otherwise, correlation factors determined by simulation have to be introduced 
into the system of eqs. [TJ 

The known quantities that appear in the system of equations (the number of jets 
in each of the two samples before and after one or both taggers have been applied) are 
on the left hand side of eqs. [TJ The eight unknown quantities, which are the heavy 
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and light flavour jet content of the samples and the tagging efficiencies and fake rates, 
appear on the right hand side. 

The analytical solution of the system of equations will be derived in the next section, 
followed by a discussion of its numerical stability and the determination of errors on 
the solved quantities in section [3j An explicit example, showing the performance of the 
method is given in section HI 



2. Analytical solution 

Elementary algebraic operations permit one to reduce the system of eight equations to 
two equations of two unknowns, which are polynomials of multi-degree two. The choice 
of the two unknowns is not unique. Since the primary goal is to determine the heavy 
flavour jet tagging efficiency and the fake rate of a tagging algorithm T to be used in 
various data analyses, it is convenient to keep the two unknowns €t and /t in the two 
polynomials. The other unknowns can be obtained by backward substitution. In general 
two multi-variate polynomials of two unknowns with arbitrary degree can be solved by 
means of resultants [5] [6]. The two polynomials can be written in the form 

n := a f T + ai = 0, 

r 2 := b f T + h = 0. (2) 
The coefficients ao, «i and bo, b\ are polynomials in the unknown €t- Their explicit 
expressions are given by 

do = a-oo^r + a-oi, 
ai = OioCt + an, 

b = b o€ T + b i, (3) 
h = b w e T + bn, 

where the coefficients can be expressed in terms of the initial known quantities as 

a-oo = ~ n 2 p s + n s np, 
a i = nn T p s - n s n T p, 

a w = a 01 , (4) 
an = n T n T sP ~ n TS p T n + n s n T p T - n 2 T p s 
for the first polynomial and 

boo = ~ n s n T p - Prsn 2 + n T spn + nsPrn, 
boi = - n TS p T n + PTsnn T , 

bio = b i, (5) 
hi = n TS p T n T - p T sn 2 T 
for the second polynomial. The resultant with respect to fx in eqn. [5] can then be 
obtained by equating the determinant of the Sylvester matrix [7j 

a b 



Res(f T ) 



ai bi 



(6) 



Heavy flavour jet abundances and tagging rates analytically determined 



4 



to zero. The solution is a univariate polynomial of degree two which can be solved for 
the first unknown 



1 doo^ll — Oll&OO . 

e T = — ; ; h 



2 Ooo^io — Oio^oo \ \ 2 aoobio — aio&oo / o-oo^io — o-io&oo 
The solution has a two-fold ambiguity originating from the initial system of equations 
which is symmetric with respect to exchange of efficiencies and fake rates. In practice, 
the larger solution given above (positive square root) corresponds to the efficiency The 
remaining unknowns can be obtained through backward substitution and are given by 

h = (8) 

n TS - n s f T / n x 
n T - f T n 

, n s (e T - /t) + e s (f t n - n T ) 

Js = , UUJ 

nex — Ut 

n b = 7—, (11) 

— JT 

n q = n-n b , (12) 
Pb = j- (13) 



and 



Pq = P~ Pb- (14) 

In the more general case where arbitrary correlation factors are taken into account in 
the system of equations some terms do not cancel out each other and the resultant yields 
a univariate polynomial of degree eight, which can not be solved analytically anymore 
(though it could be solved semi-analytical by means of Sturm's theorem which gives 
the number of real roots of a given univariate polynomial of arbitrary degree in a given 
interval [5]. The roots would need to be polished e.g. by binary bracketing and one 
would need to take care about the increased number of solution ambiguities). 



3. Numerical stability and error propagation 

The solution of the initial system of eight equations contains three singularities due to 
vanishing denominators in case of not carefully chosen working points. A first singularity 
occurs in the denominator of eqn. [9] which determines the efficiency 65. The singularity 
of the type {tit — nfr) appears if all tagged jets are of light flavour and they have 
been misidentified as heavy flavour jets. This would certainly correspond to an ill-posed 
initial condition which prohibits the sensible use of a given tagging algorithm. The 
second singularity occurs in the denominator of eqn. [TU] which determines the fake rate 
fs- A singularity of type (nx — nex) appears if all tagged jets are of heavy flavour. 
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This particular case corresponds to a perfect algorithm which could be solved with just 
four equations. In practice it cannot be avoided that some of the tagged jets will be 
misidentified. The fake rate of optimised tagging algorithms depends on the working 
point, which is typically chosen at the percent level ( see e.g. [8]) far away from zero 
within real precision at which point the implementation of the analytical solution looses 
its power of prediction. The last singularity occurs in the denominator of eqs. [TTland[T3l 
There, the singularity of type (e^ — fr) corresponds to the case of equal efficiency and 
fake rate. The efficiency and fake rate of a given tagging algorithm could be determined 
with a system of four equations but one would not be able to determine the heavy and 
light flavour content of the sample. This working point is far away from typical chosen 
settings which are driven by the goal of having a maximal efficiency at minimal fake 
rate. To prevent the problematic regions around the singularities one has to choose 
working points of the taggers which avoid small or even vanishing denominators. This 
can be achieved in probing different working points and verifying the values of the 
denominators. 

The errors of the analytical solution are obtained through gaussian error 
propagation taking all correlations into account. The evaluation of the errors of 
subsequently solved quantities, obtained through backward substitution, is accurately 
done by error propagation through differentials as explained below. In this way the 
errors of the subsequent solved quantities conserve all their dependencies on the knowns 
such that terms with different signs compensate before the gaussian quadrature takes 
place. 

The errors of the knowns entering into the error propagation are the errors of the 
data samples n and p, which are assumed to be Poisson distributed. The errors of 
the tagged subsamples (n T , p T , n s , p s , n TS and prs) are taken binomial since the 
subsamples have been obtained by applying a binary acceptance/rejection procedure 
through the taggers. They can be computed from the known variables as 

An = y/n, 

Anx = \JnT{n — nr)/n, 

An s = \/n s (n - n s )/n, 

An T s = \Jn T s(n - n TS )/n (15) 

and equivalently for the data sample p. The variance of the tagging efficiency is then 
given by 

<^) 2 =(S)W + (f)W + g)W 

+ f p) ' ( Ap T f + (p) ' (A„ s f + fr) W 
\dpTj \dn s ) \dpsj 
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n dex der a » „ der der . . 
+ 2—-—g npT AnAp T + 2—-—g nps AnAp s 
an apT an dps 

de T dex . . dex der 

+ 2— — — g npTS AnAp TS + 2- — g nTP An T Ap 

dn dpTs dnx dp 

dtT dc-T a a r. dex der . A 
+ 2- — - — g nTPT An T Ap T + 2- — — Qn TPs An T Ap s 
dnr dpT dnx dps 

n der dex a a r. a a 

+ 2- — Q nTPTS An T Ap TS + 2- — Q nsP An s Ap 

dnr dpTs dn s dp 



deT deT » . deT der 

+ 2- — — g nsPT An s Ap T + 2- — — g ngPiS A7i s Ap s 
an s dpr dn^ dp s 



de T de T . . de T de T . . 

+ 2- — Quspts AnsApTs + 2- — g nTSP An TS Ap 

dn s dpTs dn T s dp 

deT deT A A <^t der 

dey der A A 

+ 2- Qn TSPTS An T sApTs 

dn TS dpTs 

n deT deT A A c^.deT deT A A 
+ 2— — — — Q nnT AnAn T + 2— — — — £ nris AnAn 5 
an anT an an 5 

de T de T A A n de T de T A A 

+ 2— -j Q nnTS AnAn T s + 2- — - — & rns An T Ans 

an dn^ anr dns 

deT deT \ \ deT deT 

+ 2- — Q nTnTS An T An TS + 2- — g nsnTS An s An T s 

dnT dnTs dns dnTs 

n de T de T . A _ de T de T A A 
+ 2 ^d^^ ApApT + 2 l^d^ gpPsApAPS 

deT deT A A deT deT 

+ 2— — — g PPTS ApAp TS + 2— — g PTPs Ap T Ap s 

dp dpTs dpT dps 

deT deT a a deT deT A a 

+ 2— g PTPTS Ap T ApTs + 2—- g PSPTS Ap s Ap TS 

dp T dpTs dps dpTs 

and equivalently for the fake rate /t and the other solved quantities. The correlation 
coefficient g of two variables n a and pb can be determined by means of the relation 

^Pi, = — — K n p b — 16 

nU p nU p 

where nUp are the jets belonging to the sample n or p and n a C\pb are the jets in common 
between the subsamples n a and n&. It should be noted that the total number of jets 
per event could also be exploited in the definition of the correlation coefficient. Given 
the fact that the correlation terms are in general small the definition above is used in 
the following without further discussion. To keep the numerical evaluation efficient, the 
derivatives are computed exploiting differentials (deT, dfx, des, dfs, dnb, dn q , dpb and 
dp q ). The derivative of the tagging efficiency with respect to the data sample size n is 
then given by 

deT deT da 00 deT da i de T da 1Q deT dan q„s 
dn da 00 dn <9a i dn <9a 10 dn da n dn 
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de T db 00 de T db 01 de T db 10 de T dbn 
dboo dn <%oi dn db\o dn dbn dn 
and similarly for the other variables. Some of the right hand terms vanish, but most of 
them are always different from zero. Which terms vanish depends on the solved variable 
and the derivative which is being considered. To determine the error on a subsequent 
quantity like the tagging efficiency of the auxiliary tagger S (eq. Mj accurately, its 
dependences on previously solved-for quantities are substituted to obtain a function of 
the form 

e s = e s( a o(b a oi) a i0) fy)Cb boi, bio, &n)- (18) 

The derivatives can then be determined as above to compute the final error in this 
quantity. This way of determining the errors has also the advantage that their magnitude 
does not depend on the order in which the unknowns have been solved. 



4. Performance 



The system of equations does not address charm flavour sample content. Other methods 
like fitting of muon in jets relative transverse momentum sample distributions to 
templates of different jet flavour contents (b, c and light) rely on simulation and the 
c and light flavour content templates are typically difficult to distinguish [9]. Within 
the system of eight equations the c flavour content is predominantly absorbed into the 
light flavour subsample. In this way the method is able to disentangle self-consistently 
the heavy and light flavour content of two given samples. However it is not possible to 
distinguish between light and c flavour contents. More details about this problem can 
be found in [2J. 

As a concrete example lets consider two data samples n, p and their corresponding 
tagged subsamples 



n 


= 758925, 


riT 


= 73076, 


n s 


= 376891, 


n TS 


= 49810, 


P 


= 11082, 


Pt 


= 2406, 


Ps 


= 7198, 


Pts 


= 1778. 



The solution of the unknown quantities and their errors, determined as explained 
in the last section, are given by 

e T = 0.298 ±0.032, 
f T = 0.026 ± 0.005, 
e s = 0.751 ± 0.007, 
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fs = 0.408 ± 0.016, 

n b = 195691 ± 25645, (20) 
n q = 563234 ± 24670, 
p b = 7794 ± 978, 
p g = 3288 ± 986. 

This example shows typical numbers of a life time based tagger T to be probed and 
an auxiliary de-correlated tagger S, needed to solve the system of equations. The 
denominators 

n T - nf T = 53344 

n T - ne T = 73076 (21) 
e T -f T = 0.272 

are far enough from singularities to ensure reasonably small errors on the solved 
quantities. If initial conditions have been chosen such that the solutions are getting 
close to a singularity, the computed errors diverge, indicating that an ill-posed problem 
is attempted to be solved. In this way one is protected against the misuse of poorly 
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Figure 1. The relative error of the tagging rate es as a function of the denominator 
n T — nfr, which is needed for the computation of es. The scan over the singularity has 
been achieved in varying the known quantity nxs- Each dot corresponds to a solution 
of tits- The interpolation function has been obtained with cubic splines. 
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adjusted working points of the taggers and badly chosen data samples. To minimize 
the errors it is convenient to use working points of the tagging algorithms which are 
far away from the singularities. In addition the heavy flavour content of the two data 
samples should not be too similar to each other, which would be the case if their heavy 
flavour contents would be consistent with each other within errors. In practice the two 
data samples can be realized by means of two different trigger requirements, one for 
each sample. An appropriate choice are jet only triggers on the one hand and muon 
plus jets triggers on the other hand. Since muons in jets indicate the presence of heavy 
flavour decays, the later category of triggers will provide a sample enhanced in heavy 
flavour content with respect to the former one. 

To demonstrate the behaviour of initial conditions approaching a singularity the 
input values of the example given by eqs. ( TT9l) have been taken, with exception of 
the number of double tagged jets uts which has been varied. Fig. CD shows the 
relative error Aes/ts as a function of the denominator (ut — ?i/t) on which the tagging 
efficiency e$ depends. The values of the varied variable uts are also indicated. The 
relative error Aes/es exceeds unity while approaching the singularity. The relative 
error is monotonically decreasing from the singularity to the edges of the allowed 
phase space < uts < mm(nT,n s ). Pseudoexperiments have been conducted to 
verify that the computed efficiency e$ is also gaussian distributed in the vicinity of 
the singularity around two different central values uts — 35945 for which Aes/es — 1 
and Uts = 36200 for which Aes/es — 10. Furthermore the determined errors of the 
tagging efficiency and fake rate of the probed tagger have been checked for coverage by 
means of pseudoexperiments where the known quantities have been varied within their 
errors (see e.g. eqns. H5]for the errors of the sample n and corresponding subsamples). 

Different working points of the taggers can be probed to maximize the denominators 
and minimize the errors on the most important quantities of interest, which are the 
efficiency and fake rate of the probed tagger. Their errors enter among others into the 
systematics of cross section measurements and limit calculations of searches for new 
physics where heavy flavour jets are involved. 

5. Conclusions 

An algebraic way of determining heavy and light flavour jet tagging efficiencies has been 
discussed. The analytical solution of a system of eight non-linear equations has been 
obtained by means of resultants. Its singularities suggest prescriptions to prevent badly 
chosen sample flavour compositions and working points of the used tagging algorithms. 
Errors are obtained by gaussian error propagation, taking correlations between the 
samples and subsamples into account. They diverge as one approaches the singularities 
making the method robust against its usage at badly chosen working points and allowing 
for optimisation. 
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